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COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
OF BREAST CANCER AND METHODS FOR THEIR USE 

REFERENCE TO RELATED APPLICATIONS 
5 This application is a continuation-in-part of U.S. Patent Application 

No. 09/118,627, filed July 17, 1998, which is a continuation-in-part of U.S. Patent 
Application No. 08/998,253, filed December 24, 1997. 

TECHNICAL FIELD 

10 The present invention relates generally to compositions and methods for 

the treatment and diagnosis of breast cancer. The invention is more particularly related 
to polypeptides comprising at least a portion of a protein that is preferentially expressed 
in breast tumor tissue and to polynucleotide molecules encoding such polypeptides. 
Such polypeptides may be used in vaccines and pharmaceutical compositions for 

15 treatment of breast cancer. Additionally such polypeptides and polynucleotides may be 
used in the immunodiagnosis of breast cancer. 

BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for v^omen in the United 
20 States and throughout the world. Although advances have been made in detection and 
treatment of the disease, breast cancer remains the second leading cause of cancer- 
related deaths in women, affecting more than 180,000 women in the United States each 
year. For women in North America, the life-time odds of getting breast cancer are now 
one in eight, 

25 No vaccine or other universally successful method for the prevention or 

treatment of breast cancer is currently available. Management of the disease currently 
relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 

30 treatment for a particular breast cancer is often selected based on a variety of prognostic 
parameters, including an analysis of specific tumor markers. See, e.g.. Porter- Jordan 
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and Lippman, Breast Cancer 5:73-100 (1994). However, the use of established markers 
often leads to a result that is difficult to interpret, and the high mortality observed in 
breast cancer patients indicates that improvements are needed in the treatment, 
diagnosis and prevention of the disease. 
5 Accordingly, there is a need in the art for improved methods for therapy 

and diagnosis of breast cancer. The present invention fulfills these needs and further 
provides other related advantages. 

SUMMARY OF THE INVENTION 

10 The present invention provides compounds and methods for 

immunotherapy of breast cancer. In one aspect, isolated polypeptides are provided 
comprising at least an immunogenic portion of a breast tximor protein or a variant of 
said protein that differs only in conservative substitutions and/or modifications, wherein 
the breast tumor protein comprises an amino acid sequence encoded by a polynucleotide 

15 molecule having a partial sequence selected from the group consisting of (a) nucleotide 
sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45-52 and 55-67, 72, 73, and 89-94, 
(b) complements of said nucleotide sequences and (c) sequences that hybridize to a 
sequence of (a) or (b) under moderately stringent conditions. 

In related aspects, isolated polynucleotide molecules encoding the above 

20 polypeptides are provided. In specific embodiments, such polynucleotide molecules 
have partial sequences provided in SEQ ID NOS: 3, 10, 17, 24, 45-52 and 55-67, 72, 
73, and 89-94. The present invention further provides expression vectors comprising 
the above polynucleotide molecules and host cells transformed or transfected with such 
expression vectors. In preferred embodiments, the host cells are selected from the 

25 group consisting of E. coli, yeast and mammalian cells. 

In another aspect, the present invention provides fusion proteins 
comprising a first and a second inventive polypeptide or, alternatively, an inventive 
polypeptide and a known breast antigen. 

The present invention also provides pharmaceutical compositions 

30 comprising at least one of the above polypeptides, or a polynucleotide molecule 
encoding such a polypeptide, and a physiologically acceptable carrier, together with 
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vaccines comprising at least one or more such polypeptide or polynucleotide molecule 
in combination with a non-specific immune response enhancer. Pharmaceutical 
compositions and vaccines comprising one or more of the above fusion proteins are also 
provided. 

5 In related aspects, pharmaceutical compositions for the treatment of 

breast cancer comprising at least one polypeptides and a physiologically acceptable 
carrier are provided, wherein the polypeptide comprises an immunogenic portion of a 
breast tumor protein or a variant thereof, the breast tumor protein being encoded by a 
polynucleotide molecule having a partial sequence selected from the group consisting 

10 of: (a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 1 1-16, 18-23, 25-44, 53, 
54, 68-71, and 74-88, (b) complements of said nucleotide sequences, and (c) sequences 
that hybridize to a sequence of (a) or (b) under moderately stringent conditions. The 
invention also provides vaccines for the treatment of breast cancer comprising such 
polypeptides in combination with a non-specific immune response enhancer, together 

15 with pharmaceutical compositions and vaccines comprising at least one polynucleotide 
molecule having a partial sequence provided in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 
25-44, 53, 54, 68-71, and 74-88, 

In yet another aspect, methods are provided for inhibiting the 
development of breast cancer in a patient, comprising administering an effective amount 

20 of at least one of the above pharmaceutical compositions and/or vaccines. 

The present invention also provides methods for immunodiagnosis of 
breast cancer, together with kits for use in such methods. In one specific aspect of the 
present invention, methods are provided for detecting breast cancer in a patient, 
comprising: (a) contacting a biological sample obtained from a patient with a binding 

25 agent that is capable of binding to one of the inventive polypeptides; and (b) detecting 
in the sample a protein or polypeptide that binds to the binding agent. In preferred 
embodiments, the binding agent is an antibody, most preferably a monoclonal antibody. 

In related aspects, methods are provided for monitoring the progression 
of breast cancer in a patient, comprising: (a) contacting a biological sample obtained 

30 from a patient with a binding agent that is capable of binding to one of the above 
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polypeptides; (b) determining in the sample an amount of a protein or polypeptide that 
binds to the binding agent; (c) repeating steps (a) and (b); and comparing the amounts 
of polypeptide detected in steps (b) and (c). 

Within related aspects, the present invention provides antibodies, 

5 preferably monoclonal antibodies, that bind to the inventive polypeptides, as well as 
diagnostic kits comprising such antibodies, and methods of using such antibodies to 
inhibit the development of breast cancer. 

The present invention further provides methods for detecting breast 
cancer comprising: (a) obtaining a biological sample from a patient; (b) contacting the 

10 sample with a first and a second oligonucleotide primer in a polymerase chain reaction, 
at least one of the oligonucleotide primers being specific for a DNA molecule that 
encodes one of the above polypeptides; and (c) detecting in the sample a DNA sequence 
that amplifies in the presence of the first and second oligonucleotide primers. In a 
preferred embodiment, at least one of the oligonucleotide primers comprises at least 

15 about 10 contiguous nucleotides of a DNA molecule having a partial sequence selected 
from the group consisting of SEQ ID NOS: 1-94. 

In a further aspect, the present invention provides a method for detecting 
breast cancer in a patient comprising: (a) obtaining a biological sample from the 
patient; (b) contacting the sample with an oligonucleotide probe specific for a 

20 polynucleotide molecule that encodes one of the above polypeptides; and (c) detecting 
in the sample a polynucleotide sequence that hybridizes to the oligonucleotide probe. 
Preferably, the oligonucleotide probe comprises at least about 15 contiguous 
nucleotides of a DNA molecule having a partial sequence selected from the group 
consisting of SEQ ID NOS: 1-94. 

25 In related aspects, diagnostic kits comprising the above oligonucleotide 

probes or primers are provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
hereby incorporated by reference in their entirety as if each was incorporated 

30 individually. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. lA and B show the specific lytic activity of a first and a second B511S- 
5 specific CTL clone, respectively, measured on autologous LCL transduced with B511s 
(filled squares) or HLA-A3 (open squares). 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to 

10 compositions and methods for the immunotherapy and diagnosis of breast cancer. The 
inventive compositions are generally isolated polypeptides that comprise at least a 
portion of a breast tumor protein. Also included within the present invention are 
molecules (such as an antibody or fragment thereof) that bind to the inventive 
polypeptides. Such molecules are referred to herein as "binding agents." 

15 In particular, the subject invention discloses polypeptides comprising at 

least a portion of a human breast tumor protein, or a variant thereof, wherein the breast 
tumor protein includes an amino acid sequence encoded by a polynucleotide molecule 
including a sequence selected from the group consisting of: nucleotide sequences 
recited in SEQ ID NOS: 1- 94, the complements of said nucleotide sequences, and 

20 variants thereof. As used herein, the term "polypeptide" encompasses amino acid 
chains of any length, including full length proteins, wherein the amino acid residues are 
linked by covalent peptide bonds. Thus, a polypeptide comprising a portion of one of 
the above breast proteins may consist entirely of the portion, or the portion may be 
present within a larger polypeptide that contains additional sequences. The additional 

25 sequences may be derived firom the native protein or may be heterologous, and such 
sequences may be immunoreactive and/or antigenic. 

As used herein, an "immunogenic portion" of a human breast tumor 
protein is a portion that is capable of eliciting an immune response in a patient inflicted 
with breast cancer and as such binds to antibodies present within sera from a breast 

30 cancer patient. Such immunogenic portions generally comprise at least about 5 amino 
acid residues, more preferably at least about 1 0, and most preferably at least about 20 
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amino acid residues. Immunogenic portions of the proteins described herein may be 
identified in antibody binding assays. Such assays may generally be performed using 
any of a variety of means known to those of ordinary skill in the art, as described, for 
example, in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor 
5 Laboratory, Cold Spring Harbor, NY, 1988. For example, a polypeptide may be 
immobilized on a solid support (as described below) and contacted with patient sera to 
allow binding of antibodies within the sera to the immobilized polypeptide. Unbound 
sera may then be removed and bound antibodies detected using, for example, ^^^I- 
labeled Protein A. Alternatively, a polypeptide may be used to generate monoclonal 

10 and polyclonal antibodies for use in detection of the polypeptide in blood or other fluids 
of breast cancer patients. Methods for preparing and identifying immunogenic portions 
of antigens of known sequence are well known in the art and include those summarized 
in Paul, Fundamental Immunology, 3'^ ed.. Raven Press, 1993, pp. 243-247. 

The term "polynucleotide(s)," as used herein, means a single or double- 

15 stranded polymer of deoxyribonucleotide or ribonucleotide bases and includes DNA 
and corresponding RNA molecules, including HnRNA and mRNA molecules, both 
sense and anti-sense strands, and comprehends cDNA, genomic DNA and recombinant 
DNA, as well as wholly or partially synthesized polynucleotides. An HnRNA molecule 
contains introns and corresponds to a DNA molecule in a generally one-to-one maimer. 

20 An mRNA molecule corresponds to an HnRNA and DNA molecule from which the 
introns have been excised. A polynucleotide may consist of an entire gene, or any 
portion thereof. Operable anti-sense polynucleotides may comprise a fragment of the 
corresponding polynucleotide, and the definition of "polynucleotide" therefore includes 
all such operable anti-sense fragments. 

25 The compositions and methods of the present invention also encompass 

variants of the above polypeptides and polynucleotides. A polypeptide "variant," as 
used herein, is a polypeptide that differs from the recited polypeptide only in 
conservative substitutions and/or modifications, such that the therapeutic, antigenic 
and/or immunogenic properties of the polypeptide are retained. In a preferred 

30 embodiment, variant polypeptides differ from an identified sequence by substitution. 
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deletion or addition of five amino acids or fewer. Such variants may generally be 
identified by modifying one of the above polypeptide sequences, and evaluating the 
antigenic properties of the modified polypeptide using, for example, the representative 
procedures described herein. Polypeptide variants preferably exhibit at least about 70%, 
5 more preferably at least about 90% and most preferably at least about 95%) identity 
(determined as described below) to the identified polypeptides. 

As used herein, a "conservative substitution" is one in which an amino 
acid is substituted for another amino acid that has similar properties, such that one 
skilled in the art of peptide chemistry would expect the secondary structure and 

10 hydropathic nature of the polypeptide to be substantially unchanged. In general, the 
following groups of amino acids represent conservative changes: (1) ala, pro, gly, glu, 
asp, gin, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; 
and (5) phe, tyr, trp, his. 

Variants may also, or alternatively, contain other modifications, 

15 including the deletion or addition of amino acids that have minimal influence on the 
antigenic properties, secondary structure and hydropathic nature of the polypeptide. For 
example, a polypeptide may be conjugated to a signal (or leader) sequence at the N- 
terminal end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 

20 sequence for ease of synthesis, purification or identification of the polypeptide (e.g., 
poly-His), or to enhance binding of the polypeptide to a solid support. For example, a 
polypeptide may be conjugated to an immunoglobulin Fc region. 

A nucleotide "variant" is a sequence that differs fi*om the recited 
nucleotide sequence in having one or more nucleotide deletions, substitutions or 

25 additions. Such modifications may be readily introduced using standard mutagenesis 
techniques, such as oligonucleotide-directed site-specific mutagenesis as taught, for 
example, by Adelman et al. (DNA, 2:183, 1983). Nucleotide variants may be naturally 
occurring allelic variants, or non-naturally occurring variants. Variant nucleotide 
sequences preferably exhibit at least about 70%), more preferably at least about 80%o and 
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most preferably at least about 90% identity (determined as described below) to the 
recited sequence. 

The antigens provided by the present invention include variants that are 
encoded by DNA sequences which are substantially homologous to one or more of the 
5 DNA sequences specifically recited herein. "Substantial homology," as used herein, 
refers to DNA sequences that are capable of hybridizing under moderately stringent 
conditions. Suitable moderately stringent conditions include prewashing in a solution 
of 5X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing at 50°C-65°C, 5X SSC, 
overnight or, in the event of cross-species homology, at 45°C with 0.5X SSC; followed 

10 by washing twice at 65°C for 20 minutes with each of 2X, 0.5X and 0.2X SSC 
containing 0.1% SDS. Such hybridizing DNA sequences are also within the scope of 
this invention, as are nucleotide sequences that, due to code degeneracy, encode an 
immunogenic polypeptide that is encoded by a hybridizing DNA sequence. 

Two nucleotide or polypeptide sequences are said to be "identical" if the 

15 sequence of nucleotides or amino acid residues in the two sequences is the same when 
aligned for maximum correspondence as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 

20 usually 30 to about 75, more preferably 40 to about 50, in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the 
two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 

25 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Resarch Foundaiton, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 

30 Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
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vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, FM, (1989) 
Fast and sensitive multiple sequence alignments on a microcomputer CABIOS 5:151- 
153; Myers, E.W. and Muller W. (1988) Optimal alignments in linear space CABIOS 
4\\\-\l\ Robinson, E.D. (1971) Comh. Theor 77:105; Santou, N. Nes, M. (1987) The 
5 neighbor joining method. A new method for reconstructing phylogenetic trees Mol 
Biol EvoL 4:406-425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - 
the Principles and Practice of Numerical Taxonomy^ Freeman Press, San Francisco, 
CA; Wilbur, WJ, and Lipman, D.J. (1983) Rapid similarity searches of nucleic acid and 
protein data banks Proa Natl Acad, ScL USA S0:726-730. 

10 Preferably, the "percentage of sequence identity" is determined by 

comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e. gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 

15 not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases or amino acid residue occurs in both sequences to yield the number 
of matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e. the window size) and multiplying the results by 

20 100 to yield the percentage of sequence identity. 

Also included in the scope of the present invention are alleles of the genes 
encoding the nucleotide sequences recited herein. As used herein, an "allele" or 
"allellic sequence" is an alternative form of the gene which may result from at least one 
mutation in the nucleic acid sequence. Alleles may result in altered mRNAs or 

25 polypeptides whose structure or function may or may not be altered. Any given gene 
may have none, one, or many allelic forms. Common mutational changes which give 
rise to alleles are generally ascribed to natural deletions, additions, or substitutions of 
nucleotides. Each of these types of changes may occur alone or in combination with the 
others, one or more times in a given sequence. 
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For breast tumor polypeptides with immunoreactive properties, variants may, 
alternatively, be identified by modifying the amino acid sequence of one of the above 
polypeptides, and evaluating the immunoreactivity of the modified polypeptide. For 
breast tumor polypeptides usefiil for the generation of diagnostic binding agents, a 
5 variant may be identified by evaluating a modified polypeptide for the ability to 
generate antibodies that detect the presence or absence of breast cancer. Such modified 
sequences may be prepared and tested using, for example, the representative procedures 
described herein. 

The breast tumor proteins of the present invention, and polynucleotide 

10 molecules encoding such proteins, may be isolated firom breast tumor tissue using any 
of a variety of methods well knovm in the art. Polynucleotide sequences corresponding 
to a gene (or a portion thereof) encoding one of the inventive breast tumor proteins may 
be isolated from a breast tumor cDNA library using a subtraction technique as described 
in detail below. Examples of such DNA sequences are provided in SEQ ID NOS: 1- 94. 

15 Partial polynucleotide sequences thus obtained may be used to design oligonucleotide 
primers for the amplification of full-length polynucleotide sequences in a polymerase 
chain reaction (PGR), using techniques well known in the art (see, for example, MuUis 
et al.. Cold Spring Harbor Symp. Quant Biol, 57:263, 1987; Erlich ed,, PCR 
Technology, Stockton Press, NY, 1989). Once a polynucleotide sequence encoding a 

20 polypeptide is obtained, any of the above modifications may be readily introduced using 
standard mutagenesis techniques, such as oligonucleotide-directed site-specific 
mutagenesis as taught, for example, by Adelman et al. {DNA, 2:183, 1983). 

The breast tumor polypeptides disclosed herein may also be generated by 
synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 

25 amino acids, and generally fewer than about 50 amino acids, may be generated using 
techniques well known to those of ordinary skill in the art. For example, such 
polypeptides may be synthesized using any of the commercially available solid-phase 
techniques, such as the Merrifield solid-phase synthesis method, where amino acids are 
sequentially added to a growing amino acid chain (see, for example, Merrifield, J. Am. 

30 Chem. Sac. §5:2149-2146, 1963). Equipment for automated synthesis of polypeptides 
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is commercially available from suppliers such as Perkin Elmer/Applied BioSystems 
Division (Foster City, CA), and may be operated according to the 
manufacturer's instructions. 

Alternatively, any of the above polypeptides may be produced 

5 recombinantly by inserting a polynucleotide sequence that encodes the polypeptide into 
an expression vector and expressing the protein in an appropriate host. Any of a variety 
of expression vectors known to those of ordinary skill in the art may be employed to 
express recombinant polypeptides of this invention. Expression may be achieved in any 
appropriate host cell that has been transformed or transfected with an expression vector 

10 containing a polynucleotide molecule that encodes a recombinant polypeptide. 
Suitable host cells include prokaryotes, yeast and higher eukaryotic cells. Preferably, 
the host cells employed are E, coli, yeast or a mammalian cell line, such as CHO cells. 
The polynucleotide sequences expressed in this manner may encode naturally occurring 
polypeptides, portions of naturally occurring polypeptides, or other variants thereof. 

15 In general, regardless of the method of preparation, the polypeptides 

disclosed herein are prepared in an isolated, substantially pure form (i.e., the 
polypeptides are homogenous as determined by amino acid composition and primary 
sequence analysis). Preferably, the polypeptides are at least about 90% pure, more 
preferably at least about 95% pure and most preferably at least about 99% pure. In 

20 certain preferred embodiments, described in more detail below, the substantially pure 
polypeptides are incorporated into pharmaceutical compositions or vaccines for use in 
one or more of the methods disclosed herein. 

In a related aspect, the present invention provides fusion proteins 
comprising a first and a second inventive polypeptide or, alternatively, a polypeptide of 

25 the present invention and a known breast tumor antigen, together with variants of such 
fusion proteins. 

A polynucleotide sequence encoding a fusion protein of the present 
invention is constructed using known recombinant DNA techniques to assemble 
separate polynucleotide sequences encoding the first and second polypeptides into an 
30 appropriate expression vector. The 3' end of a polynucleotide sequence encoding the 
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first polypeptide is ligated, with or without a peptide linker, to the 5' end of a 
polynucleotide sequence encoding the second polypeptide so that the reading frames of 
the sequences are in phase to permit mRNA translation of the two DNA sequences into 
a single fusion protein that retains the biological activity of both the first and the 

5 second polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure that each polypeptide folds into 
its secondary and tertiary structures. Such a peptide linker sequence is incorporated into 
the fusion protein using standard techniques well known in the art. Suitable peptide 

10 linker sequences may be chosen based on the following factors: (1) their ability to 
adopt a flexible extended conformation; (2) their inability to adopt a secondary structure 
that could interact with functional epitopes on the first and second polypeptides; and 
(3) the lack of hydrophobic or charged residues that might react with the polypeptide 
functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser 

15 residues. Other near neutral amino acids, such as Thr and Ala may also be used in the 
linker sequence. Amino acid sequences which may be usefully employed as linkers 
include those disclosed in Maratea et al.. Gene 40:39-46, 1985; Murphy etaL, Proc, 
Natl Acad ScL USA §3:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. Patent 
No. 4,751,180. The linker sequence may be from 1 to about 50 amino acids in length. 

20 Peptide sequences are not required when the first and second polypeptides have non- 
essential N-terminal amino acid regions that can be used to separate the functional 
domains and prevent steric interference. 

The ligated polynucleotide sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 

25 responsible for expression of polynucleotides are located only 5' to the polynucleotide 
sequence encoding the first polypeptides. Similarly, stop codons require to end 
translation and transcription termination signals are only present 3' to the polynucleotide 
sequence encoding the second polypeptide. 

Fusion proteins are also provided that comprise a polypeptide of the 

30 present invention together with an unrelated immunogenic protein. Preferably the 
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immunogenic protein is capable of eliciting a recall response. Examples of such 
proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, Stoute et 
al. New Engl J. Med., 336-M-9\ (1997)). 

Polypeptides of the present invention that comprise an immunogenic 

5 portion of a breast tumor protein may generally be used for immunotherapy of breast 
cancer, wherein the polypeptide stimulates the patient's own immune response to breast 
tumor cells. In further aspects, the present invention provides methods for using one or 
more of the immunoreactive polypeptides encoded by a polynucleotide molecule 
having a sequence provided in SEQ ID NOS: 1- 94 (or fusion proteins comprising one 

10 or more such polypeptides and/or polynucleotides encoding such polypeptides) for 
immunotherapy of breast cancer in a patient. As used herein, a "patient" refers to any 
warm-blooded animal, preferably a human. A patient may be afflicted with a disease, 
or may be free of detectable disease. Accordingly, the above immunoreactive 
polypeptides (or fusion proteins or polynucleotide molecules encoding such 

15 polypeptides) may be used to treat breast cancer or to inhibit the development of breast 
cancer. The polypeptides may be administered either prior to or following surgical 
removal of primary tumors and/or treatment by administration of radiotherapy and 
conventional chemotherapeutic drugs. 

In these aspects, the polypeptide or fusion protein is generally present 

20 within a pharmaceutical composition and/or a vaccine. Pharmaceutical compositions 
may comprise one or more polypeptides, each of which may contain one or more of the 
above sequences (or variants thereof), and a physiologically acceptable carrier. The 
vaccines may comprise one or more of such polypeptides and a non-specific immune 
response enhancer, wherein the non-specific immune response enhancer is capable of 

25 eliciting or enhancing an immune response to an exogenous antigen. Examples of non- 
specific-immune response enhancers include adjuvants, biodegradable microspheres 
{e.g., polylactic galactide) and liposomes (into which the polypeptide is incorporated). 
Pharmaceutical compositions and vaccines may also contain other epitopes of breast 
tumor antigens, either incorporated into a combination polypeptide {i.e., a single 

30 polypeptide that contains multiple epitopes) or present wdthin a separate polypeptide. 
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Alternatively, a pharmaceutical composition or vaccine may contain 
polynucleotides encoding one or more of the above polypeptides^ such that the 
polypeptide is generated in situ. In such pharmaceutical compositions and vaccines, the 
polynucleotide may be present within any of a variety of delivery systems known to 

5 those of ordinary skill in the art, including nucleic acid expression systems, bacteria and 
viral expression systems. Appropriate nucleic acid expression systems contain the 
necessary polynucleotide sequences for expression in the patient (such as a suitable 
promoter). Bacterial delivery systems involve the administration of a bacterium (such 
as Bacillus-Calmette-Guerriri) that expresses an epitope of a breast tumor cell antigen 

10 on its cell surface. In a preferred embodiment, the polynucleotide molecules may be 
introduced using a viral expression system {e.g., vaccinia or other pox virus, retrovirus, 
or adenovirus), which may involve the use of a non-pathogenic (defective), replication 
competent virus. Suitable systems are disclosed, for example, in Fisher-Hoch et al., 
PNAS86\2>\l-32\, 1989; Flexner et al., Ann. KY. Acad, Sci. J6P:86-103, 1989; Flexner 

15 et al. Vaccine S:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; 
Berkner, Biotechniques <5:616-627, 1988; Rosenfeld et al.. Science 252:431-434, 1991; 
Kolls et al, PNAS P7:215-219, 1994; Kass-Eisler et al., PNAS 90:11498-11502, 1993; 
Guzman et al., Circulation 55:2838-2848, 1993; and Guzman et al, C/r. Res. 

20 75:1202-1207, 1993. Techniques for incorporating polynucleotides into such 
expression systems are well known to those of ordinary skill in the art. The 
polynucleotides may also be "naked," as described, for example, in published PCT 
apphcation WO 90/1 1092, and Ulmer et aL, Science 259:1745-1749, 1993, reviewed by 
Cohen, Science 25P: 169 1-1 692, 1993. The uptake of naked polynucleotides may be 

25 increased by coating the polynucleotides onto biodegradable beads, which are 
efficiently transported into the cells. 

Routes and frequency of administration, as well as dosage, will vary 
from individual to individual and may parallel those currently being used in 
immunotherapy of other diseases. In general, the pharmaceutical compositions and 

30 vaccines may be administered by injection (e.g., intracutaneous, intramuscular, 
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intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Between 1 and 
10 doses may be administered over a 3-24 week period. Preferably, 4 doses are 
administered, at an interval of 3 months, and booster administrations may be given 
periodically thereafter. Alternate protocols may be appropriate for individual patients. 

5 A suitable dose is an amount of polypeptide or polynucleotide molecule that is effective 
to raise an immune response (cellular and/or humoral) against breast tumor cells in a 
treated patient. A suitable immune response is at least 10-50% above the basal (i.e., 
untreated) level. In general, the amount of polypeptide present in a dose (or produced in 
situ by the polynucleotide in a dose) ranges from about 1 pg to about 100 mg per kg of 

10 host, typically from about 10 pg to about 1 mg, and preferably from about 100 pg to 
about 1 |ag. Suitable dose sizes will vary with the size of the patient, but will typically 
range from about 0.01 mL to about 5 mL. 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 

15 will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a 
lipid, a wax and/or a buffer. For oral administration, any of the above carriers or a solid 
carrier, such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, 
talcum, cellulose, glucose, sucrose, and/or magnesium carbonate, may be employed. 

20 Biodegradable microspheres (e,g., polylactic glycolide) may also be employed as 
carriers for the pharmaceutical compositions of this invention. Suitable biodegradable 
microspheres are disclosed, for example, in U.S. Patent Nos, 4,897,268 and 5,075,109. 

Any of a variety of non-specific immune response enhancers may be 
employed in the vaccines of this invention. For example, an adjuvant may be included. 

25 Most adjuvants contain a substance designed to protect the antigen from rapid 
catabolism, such as aluminum hydroxide or mineral oil, and a nonspecific stimulator of 
immune response, such as lipid A, Bordella pertussis or Mycobacterium tuberculosis. 
Such adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI) and Merck 

30 Adjuvant 65 (Merck and Company, Inc., Rahway, NJ). 
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Polypeptides disclosed herein may also be employed in adoptive 
immunotherapy for the treatment of cancer. Adoptive immunotherapy may be broadly 
classified into either active or passive immunotherapy. In active immunotherapy, 
treatment relies on the in vivo stimulation of the endogenous host immune system to 

5 react against tumors with the administration of immune response-modifying agents (for 
example, tumor vaccines, bacterial adjuvants, and/or cytokines). 

In passive immunotherapy, treatment involves the deUvery of biologic 
reagents with established tumor-immune reactivity (such as effector cells or antibodies) 
that can directly or indirectly mediate antitumor effects and does not necessarily depend 

10 on an intact host immune system. Examples of effector cells include T lymphocytes 
(for example, CD8+ cytotoxic T-lymphocyte, CD4+ T-helper, tumor-infiltrating 
lymphocytes), killer cells (such as Natural Killer cells, lymphokine-activated killer 
cells), B cells, or antigen presenting cells (such as dendritic cells and macrophages) 
expressing the disclosed antigens. The polypeptides disclosed herein may also be used 

15 to generate antibodies or anti-idiotypic antibodies (as in U.S. Patent No. 4,918,164), for 
passive immunotherapy. 

The predominant method of procuring adequate numbers of T-cells for 
adoptive immunotherapy is to grow immune T-cells in vitro. Culture conditions for 
expanding single antigen-specific T-cells to several billion in number with retention of 

20 antigen recognition in vivo are well known in the art. These in vitro culture conditions 
typically utilize intermittent stimulation with antigen, often in the presence of cytokines, 
such as IL-2, and non-dividing feeder cells. As noted above, the immunoreactive 
polypeptides described herein may be used to rapidly expand antigen-specific T cell 
cultures in order to generate sufficient number of cells for immunotherapy. In 

25 particular, antigen-presenting cells, such as dendritic, macrophage or B-cells, may be 
pulsed with immunoreactive polypeptides or transfected with a polynucleotide 
sequence(s), using standard techniques well known in the art. For example, antigen 
presenting cells may be transfected with a polynucleotide sequence, wherein said 
sequence contains a promoter region appropriate for increasing expression, and can be 

30 expressed as part of a recombinant virus or other expression system. For cultured T- 
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cells to be effective in therapy, the cultured T-cells must be able to grow and distribute 
widely and to survive long term in vivo. Studies have demonstrated that cultured T- 
cells can be induced to grow in vivo and to survive long term in substantial numbers by 
repeated stimulation with antigen supplemented with IL-2 (see, for example, Cheever, 

5 M., et al, "Therapy With Cultured T Cells: Principles Revisited, " Immunological 
Reviews, 157:111, 1997). 

The polypeptides disclosed herein may also be employed to generate 
and/or isolate tumor-reactive T-cells, which can then be administered to the patient. In 
one technique, antigen-specific T-cell lines may be generated by in vivo immunization 

10 with short peptides corresponding to immunogenic portions of the disclosed 
polypeptides. The resulting antigen specific CD8+ CTL clones may be isolated from 
the patient, expanded using standard tissue culture techniques, and returned to the 
patient. 

Alternatively, peptides corresponding to immunogenic portions of the 

15 polypeptides may be employed to generate tumor reactive T cell subsets by selective in 
vitro stimulation and expansion of autologous T cells to provide antigen-specific T cells 
which may be subsequently transferred to the patient as described, for example, by 
Chang et al (Crit Rev, Oncol Hematol, 22(3), 213, 1996). Cells of the immune 
system, such as T cells, may be isolated from the peripheral blood of a patient, using a 

20 commercially available cell separation system, such as CellPro Incorporated' s (Bothell, 
WA) CEPRATE™ system (see U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; 
WO 89/06280; WO 91/16116 and WO 92/07243). The separated cells are stimulated 
with one or more of the immunoreactive polypeptides contained within a delivery 
vehicle, such as a microsphere, to provide antigen-specific T cells. The population of 

25 tirnior antigen-specific T cells is then expanded using standard techniques and the cells 
are administered back to the patient. 

In another embodiment, T-cell and/or antibody receptors specific for the 
polypeptides can be cloned, expanded, and transferred into other vectors or effector 
cells for use in adoptive immunotherapy. 

30 In a fiirther embodiment, syngeneic or autologous dendritic cells may be 
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pulsed with peptides corresponding to at least an immunogenic portion of a polypeptide 
disclosed herein. The resulting antigen-specific dendritic cells may either be transferred 
into a patient, or employed to stimulate T cells to provide antigen-specific T cells which 
may, in turn, be administered to a patient. The use of peptide-pulsed dendritic cells to 

5 generate antigen-specific T cells and the subsequent use of such antigen-specific T cells 
to eradicate tumors in a murine model has been demonstrated by Cheever et al. 
Immunological Reviews, 157:111^ 1997). 

Additionally, vectors expressing the disclosed polynucleotides may be 
introduced into stem cells taken from the patient and clonally propagated in vitro for 

10 autologous transplant back into the same patient. 

Polypeptides of the present invention may also, or alternatively, be used 
to generate binding agents, such as antibodies or fragments thereof, that are capable of 
detecting metastatic human breast tumors. Binding agents of the present invention may 
generally be prepared using methods known to those of ordinary skill in the art, 

15 including the representative procedures described herein. Binding agents are capable of 
differentiating between patients with and without breast cancer, using the representative 
assays described herein. In other words, antibodies or other binding agents raised 
against a breast tumor protein, or a suitable portion thereof, will generate a signal 
indicating the presence of primary or metastatic breast cancer in at least about 20% of 

20 patients afflicted with the disease, and will generate a negative signal indicating the 
absence of the disease in at least about 90% of individuals without primary or metastatic 
breast cancer. Suitable portions of such breast tumor proteins are portions that are able 
to generate a binding agent that indicates the presence of primary or metastatic breast 
cancer in substantially all {i.e., at least about 80%, and preferably at least about 90%) of 

25 the patients for which breast cancer would be indicated using the full length protein, and 
that indicate the absence of breast cancer in substantially all of those samples that would 
be negative when tested with full length protein. The representative assays described 
below, such as the two-antibody sandwdch assay, may generally be employed for 
evaluating the ability of a binding agent to detect metastatic human breast tumors. 
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The ability of a polypeptide prepared as described herein to generate 
antibodies capable of detecting primary or metastatic human breast tumors may 
generally be evaluated by raising one or more antibodies against the polypeptide (using, 
for example, a representative method described herein) and determining the ability of 
5 such antibodies to detect such tumors in patients. This determination may be made by 
assaying biological samples from patients with and without primary or metastatic breast 
cancer for the presence of a polypeptide that binds to the generated antibodies. Such 
test assays may be performed, for example, using a representative procedure described 
below. Polypeptides that generate antibodies capable of detecting at least 20% of 
10 primary or metastatic breast tumors by such procedures are considered to be useful in 
assays for detecting primary or metastatic human breast tumors. Polypeptide specific 
antibodies may be used alone or in combination to improve sensitivity. 

Polypeptides capable of detecting primary or metastatic human breast 
tumors may be used as markers for diagnosing breast cancer or for monitoring disease 
15 progression in patients. In one embodiment, breast cancer in a patient may be 
diagnosed by evaluating a biological sample obtained from the patient for the level of 
one or more of the above polypeptides, relative to a predetermined cut-off value. As 
used herein, suitable "biological samples" include blood, sera and urine. 

The level of one or more of the above polypeptides may be evaluated 
20 using any binding agent specific for the polypeptide(s). A "binding agent," in the 
context of this invention, is any agent (such as a compound or a cell) that binds to a 
polypeptide as described above. As used herein, "binding" refers to a noncovalent 
association between two separate molecules (each of which may be free (i.e., in 
solution) or present on the surface of a cell or a solid support), such that a "complex" is 
25 formed. Such a complex may be free or immobilized (either covalently or 
noncovalently) on a support material. The ability to bind may generally be evaluated by 
determining a binding constant for the formation of the complex. The binding constant 
is the value obtained when the concentration of the complex is divided by the product of 
the component concentrations. In general, two compounds are said to "bind" in the 
30 context of the present invention when the binding constant for complex formation 
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exceeds about 10^ L/mol. The binding constant may be determined using methods well 
known to those of ordinary skill in the art. 

Any agent that satisfies the above requirements may be a binding agent. 
For example, a binding agent may be a ribosome with or without a peptide component, 

5 an RNA molecule or a peptide. In a preferred embodiment, the binding partner is an 
antibody, or a fragment thereof. Such antibodies may be polyclonal, or monoclonal. In 
addition, the antibodies may be single chain, chimeric, CDR-grafted or humanized. 
Antibodies may be prepared by the methods described herein and by other methods well 
known to those of skill in the art. 

10 There are a variety of assay formats known to those of ordinary skill in 

the art for using a binding partner to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In a preferred embodiment, the assay involves the use of binding partner 
immobilized on a solid support to bind to and remove the polypeptide from the 

15 remainder of the sample. The bound polypeptide may then be detected using a second 
binding partner that contains a reporter group. Suitable second binding partners include 
antibodies that bind to the binding partner/polypeptide complex. Alternatively, a 
competitive assay may be utilized, in which a polypeptide is labeled with a reporter 
group and allowed to bind to the immobilized binding partner after incubation of the 

20 binding partner with the sample. The extent to which components of the sample inhibit 
the binding of the labeled polypeptide to the binding partner is indicative of the 
reactivity of the sample with the immobilized binding partner. 

The solid support may be any material known to those of ordinary skill 
in the art to which the antigen may be attached. For example, the solid support may be 

25 a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 

30 using a variety of techniques known to those of skill in the art, which are amply 
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described in the patent and scientific literature. In the context of the present invention, 
the term "immobiHzation" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct hnkage between the antigen and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 

5 Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, 
in a suitable buffer, with the solid support for a suitable amount of time. The contact 
time varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 

10 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
10 |ag, and preferably about 100 ng to about 1 |ag, is sufficient to immobilize an 
adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 

15 both the support and a functional group, such as a hydroxy 1 or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner {see^ e.g.. Pierce Immunotechnology Catalog and Handbook, 1991, at 

20 A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 
25 Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a second antibody (containing a reporter group) capable of binding to a 
different site on the polypeptide is added. The amount of second antibody that remains 
bound to the solid support is then determined using a method appropriate for the 
specific reporter group. 
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More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 

5 immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e., incubation time) is that period of time that is sufficient to detect the presence 
of polypeptide within a sample obtained from an individual with breast cancer. 

10 Preferably, the contact time is sufficient to achieve a level of binding that is at least 
about 95% of that achieved at equilibrium between bound and unbound polypeptide. 
Those of ordinary skill in the art will recognize that the time necessary to achieve 
equilibrium may be readily determined by assaying the level of binding that occurs over 
a period of time. At room temperature, an incubation time of about 30 minutes is 

15 generally sufficient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20^^. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include enzymes (such as horseradish peroxidase), substrates, 

20 cofactors, inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and 
biotin. The conjugation of antibody to reporter group may be achieved using standard 
methods known to those of ordinary skill in the art. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amoimt of time sufficient to detect the bound polypeptide. 

25 An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound second antibody is then removed 
and bound second antibody is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 

30 appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
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and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the 

5 reaction products. 

To determine the presence or absence of breast cancer, the signal 
detected from the reporter group that remains bound to the solid support is generally 
compared to a signal that corresponds to a predetermined cut-off value. In one 
preferred embodiment, the cut-off value is the average mean signal obtained when the 

10 immobilized antibody is incubated with samples from patients without breast cancer. In 
general, a sample generating a signal that is three standard deviations above the 
predetermined cut-off value is considered positive for breast cancer. In an alternate 
preferred embodiment, the cut-off value is determined using a Receiver Operator Curve, 
according to the method of Sackett et aL, Clinical Epidemiology: A Basic Science for 

15 Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, 
the cut-off value may be determined from a plot of pairs of true positive rates {i.e., 
sensitivity) and false positive rates (100%-specificity) that correspond to each possible 
cut-off value for the diagnostic test result. The cut-off value on the plot that is the 
closest to the upper left-hand comer {le.^ the value that encloses the largest area) is the 

20 most accurate cut-off value, and a sample generating a signal that is higher than the cut- 
off value determined by this method may be considered positive. Alternatively, the cut- 
off value may be shifted to the left along the plot, to minimize the false positive rate, or 
to the right, to minimize the false negative rate. In general, a sample generating a signal 
that is higher than the cut-off value determined by this method is considered positive for 

25 breast cancer. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the antibody is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized antibody as the sample passes through the membrane. A second, labeled 
30 antibody then binds to the antibody-polypeptide complex as a solution containing the 
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second antibody flows through the membrane. The detection of bound second antibody 
may then be performed as described above. In the strip test format, one end of the 
membrane to which antibody is bound is immersed in a solution containing the sample. 
The sample migrates along the membrane through a region containing second antibody 
and to the area of immobilized antibody. Concentration of second antibody at the area 
of immobilized antibody indicates the presence of breast cancer. Typically, the 
concentration of second antibody at that site generates a pattern, such as a line, that can 
be read visually. The absence of such a pattern indicates a negative result. In general, 
the amount of antibody immobilized on the membrane is selected to generate a visually 
discernible pattern when the biological sample contains a level of polypeptide that 
would be sufficient to generate a positive signal in the two-antibody sandwich assay, in 
the format discussed above. Preferably, the amount of antibody immobilized on the 
membrane ranges from about 25 ng to about 1 ^ig, and more preferably from about 
50 ng to about 500 ng. Such tests can typically be performed with a very small amount 
of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use 
with the antigens or antibodies of the present invention. The above descriptions are 
intended to be exemplary only. 

In another embodiment, the above polypeptides may be used as markers 
for the progression of breast cancer. In this embodiment, assays as described above for 
the diagnosis of breast cancer may be performed over time, and the change in the level 
of reactive polypeptide(s) evaluated. For example, the assays may be performed every 
24-72 hours for a period of 6 months to 1 year, and thereafter performed as needed. In 
general, breast cancer is progressing in those patients in whom the level of polypeptide 
detected by the binding agent increases over time. In contrast, breast cancer is not 
progressing when the level of reactive polypeptide either remains constant or decreases 
with time. 

Antibodies for use in the above methods may be prepared by any of a 
variety of techniques known to those of ordinary skill in the art. See, e.g,, Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one 
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such technique, an immunogen comprising the antigenic polypeptide is initially injected 
into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep and goats). In 
this step, the polypeptides of this invention may serve as the immunogen without 
modification. Alternatively, particularly for relatively short polypeptides, a superior 

5 immune response may be elicited if the polypeptide is joined to a carrier protein, such 
as bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected 
into the animal host, preferably according to a predetermined schedule incorporating 
one or more booster immunizations, and the animals are bled periodically. Polyclonal 
antibodies specific for the polypeptide may then be purified from such antisera by, for 

10 example, affinity chromatography using the polypeptide coupled to a suitable solid 
support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol (5:511-519, 1976, and improvements thereto. Briefly, these methods involve 

15 the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 

20 animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 

25 colonies of hybrids are observed. Single colonies are selected and tested for binding 
activity against the polypeptide, Hybridomas having high reactivity and specificity 
are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
30 yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
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vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

Monoclonal antibodies of the present invention may also be used as 
therapeutic reagents, to diminish or eliminate breast tumors. The antibodies may be 
used on their own (for instance, to inhibit metastases) or coupled to one or more 
therapeutic agents. Suitable agents in this regard include radionuclides, differentiation 
inducers, drugs, toxins, and derivatives thereof Preferred radionuclides include ^^Y, 
i23j^ i25j^ i3ij^ i86j^^^ 188^^^^ 211^^^ 2i2gj Preferred drugs include methotrexate, and 
pyrimidine and purine analogs. Preferred differentiation inducers include phorbol esters 
and butyric acid. Preferred toxins include ricin, abrin, diptheria toxin, cholera toxin, 
gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an aiJiydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A 
linker group can also serve to increase the chemical reactivity of a substituent on an 
agent or an antibody, and thus increase the coupling efficiency. An increase in 
chemical reactivity may also facilitate the use of agents, or functional groups on agents, 
which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional 
or polyfunctional reagents, both homo- and hetero-functional (such as those described 
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in the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 
sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
5 Where a therapeutic agent is more potent when free from the antibody 

portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 
10 of a disulfide bond {e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of 
derivatized amino acid side chains {e.g., U.S. Patent No. 4,638,045, to Kohn et al), by 
serum complement-mediated hydrolysis {e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 
5 It may be desirable to couple more than one agent to an antibody. In one 

embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
20 coupled directly to an antibody molecule, or linkers which provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins {e.g., U.S. Patent No, 4,507,234, to Kato et al.), peptides and polysaccharides 
25 such as aminodextran {e.g, U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle {e.g, U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No, 4,735,792 discloses representative 
30 radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
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formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. 

A variety of routes of administration for the antibodies and 
immunoconjugates may be used. Typically, administration will be intravenous, 
intramuscular, subcutaneous or in the bed of a resected tumor. It will be evident that the 
precise dose of the antibody/immunoconjugate will vary depending upon the antibody 
used, the antigen density on the tumor, and the rate of clearance of the antibody. 

Diagnostic reagents of the present invention may also comprise 
polynucleotide sequences encoding one or more of the above polypeptides, or one or 
more portions thereof. For example, at least two oligonucleotide primers may be 
employed in a polymerase chain reaction (PGR) based assay to amplify breast tumor- 
specific cDNA derived from a biological sample, wherein at least one of the 
oligonucleotide primers is specific for a DNA molecule encoding a breast tumor protein 
of the present invention. The presence of the amplified cDNA is then detected using 
techniques well known in the art, such as gel electrophoresis. Similarly, oligonucleotide 
probes specific for a DNA molecule encoding a breast tumor protein of the present 
invention may be used in a hybridization assay to detect the presence of an inventive 
polypeptide in a biological sample. 

As used herein, the term "oligonucleotide primer/probe specific for a 
DNA molecule" means an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to the 
DNA molecule in question. Oligonucleotide primers and/or probes which may be 
usefully employed in the inventive diagnostic methods preferably have at least about 
10-40 nucleotides. In a preferred embodiment, the oligonucleotide primers comprise at 
least about 10 contiguous nucleotides of a DNA molecule having a partial sequence 
selected from SEQ ID NOS: 1- 94. Preferably, oligonucleotide probes for use in the 
inventive diagnostic methods comprise at least about 15 contiguous oligonucleotides of 
a DNA molecule having a partial sequence provided in SEQ ID NOS: 1- 94, 
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Techniques for both PGR based assays and hybridization assays are well known in the 
art (see, for example, Mullis et aL Ibid; Ehrlich, Ibid). Primers or probes may thus be 
used to detect breast tumor-specific sequences in biological samples, including blood, 
urine and/or breast tumor tissue. 
5 The following Examples are offered by way of illustration and not by 

way of limitation. 

EXAMPLES 

10 Example 1 

ISOLATION AND CHARACTERIZATION OF BREAST 
TUMOR POLYPEPTIDES 

This Example describes the isolation of breast tumor polypeptides from a 

1 5 breast tumor cDN A library. 

A human breast tumor cDNA expression library was constructed from a 
pool of breast tumor poly A^ RNA from three patients using a Superscript Plasmid 
System for cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, 
Gaithersburg, MD 20897) following the manufacturer's protocol. Specifically, breast 

20 tumor tissues were homogenized with polytron (Kinematica, Switzerland) and total 
RNA was extracted using Trizol reagent (BRL Life Technologies) as directed by the 
manufacturer. The poly A"^ RNA was then purified using a Qiagen oligotex spin 
column mRNA purification kit (Qiagen, Santa Clarita, CA 91355) according to the 
manufacturer's protocol. First-strand cDNA was synthesized using the Notl/Oligo- 

25 dT18 primer. Double-stranded cDNA was synthesized, ligated with EcoRI/BstX I 
adaptors (Invitrogen, Carlsbad, CA) and digested with NotL Following size 
fractionation with Chroma Spin-1000 columns (Clontech, Palo Alto, CA 94303), the 
cDNA was ligated into the EcoRI/NotI site of pCDNA3.1 (Invitrogen, Carlsbad, CA) 
and transformed into ElectroMax E. coli DHIOB cells (BRL Life Technologies) by 

30 electroporation. 
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Using the same procedure, a normal human breast cDNA expression 
library was prepared from a pool of four normal breast tissue specimens. The cDNA 
libraries were characterized by determining the number of independent colonies, the 
percentage of clones that carried insert, the average insert size and by sequence analysis. 

5 The breast tumor library contained L14 x 10^ independent colonies, with more than 
90% of clones having a visible insert and the average insert size being 936 base pairs. 
The normal breast cDNA library contained 6x10^ independent colonies, with 83% of 
clones having inserts and the average insert size being 1015 base pairs. Sequencing 
analysis showed both libraries to contain good complex cDNA clones that were 

10 synthesized from mRNA, with minimal rRNA and mitochondrial DNA contamination 
sequencing. 

cDNA library subtraction was performed using the above breast tumor 
and normal breast cDNA libraries, as described by Hara et al {Blood, 54:189-199, 
1994) with some modifications. Specifically, a breast tumor-specific subtracted cDNA 

15 library was generated as follows. Normal breast cDNA library (70 jag) was digested 
with EcoRI, NotI, and Sful, followed by a filling-in reaction with DNA polymerase 
Klenow fragment. After phenol-chloroform extraction and ethanol precipitation, the 
DNA was dissolved in 100 |li1 of H2O, heat-denatured and mixed with 100 |li1 (100 |ig) 
of Photoprobe biotin (Vector Laboratories, Burlingame, CA), the resulting mixture was 

20 irradiated with a 270 W sunlamp on ice for 20 minutes. Additional Photoprobe biotin 
(50 |al) was added and the biotinylation reaction was repeated. After extraction with 
butanol five times, the DNA was ethanol-precipitated and dissolved in 23 \i\ H2O to 
form the driver DNA. 

To form the tracer DNA, 10 |ig breast tumor cDNA library was digested 

25 with BamHI and Xhol, phenol chloroform extracted and passed through Chroma spin- 
400 columns (Clontech). Following ethanol precipitation, the tracer DNA was 
dissolved in 5 |j,l Hp. Tracer DNA was mixed with 15 |j.1 driver DNA and 20 |il of 2 x 
hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% sodium 
dodecyl sulfate), overlaid with mineral oil, and heat-denatured completely. The sample 
30 was immediately transferred into a 68 water bath and incubated for 20 hours (long 
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hybridization [LH]). The reaction mixture was then subjected to a streptavidin 
treatment followed by phenol/chloroform extraction. This process was repeated three 
more times. Subtracted DNA was precipitated, dissolved in 12 jul H2O, mixed with 8 i^l 
driver DNA and 20 |li1 of 2 x hybridization buffer, and subjected to a hybridization at 68 
5 for 2 hours (short hybridization [SH]). After removal of biotinylated double- 
stranded DNA, subtracted cDNA was ligated into BamHI/XhoI site of chloramphenicol 
resistant pBCSK"" (Stratagene, La JoUa, CA 92037) and transformed into ElectroMax E. 
coll DHIOB cells by electroporation to generate a breast tumor specific subtracted 
cDNA library. 

10 To analyze the subtracted cDNA library, plasmid DNA was prepared 

from 100 independent clones, randomly picked from the subtracted breast tumor 
specific library and characterized by DNA sequencing with a Perkin Elmer/Applied 
Biosystems Division Automated Sequencer Model 373A (Foster City, CA). Thirty- 
eight distinct cDNA clones were found in the subtracted breast tumor-specific cDNA 

15 library. The determined 3' cDNA sequences for 14 of these clones are provided in SEQ 
ID NO: 1-14, with the corresponding 5' cDNA sequences being provided in SEQ ID 
NO: 15-28, respectively. The determined one strand (5' or 3') cDNA sequences for the 
remaining clones are provided in SEQ ID NO: 29-52. Comparison of these cDNA 
sequences with known sequences in the gene bank using the EMBL and GenBank 

20 databases (Release 97) revealed no significant homologies to the sequences provided in 
SEQ ID NO: 3, 10, 17, 24 and 45-52. The sequences provided in SEQ ID NO: 1, 2, 4-9, 
11-16, 18-23, 25-41 , 43 and 44 were found to show at least some degree of homology 
to known human genes. The sequence of SEQ ID NO: 42 was found to show some 
homology to a known yeast gene. 

25 Data was analyzed using Synteni provided GEMTOOLS Software. 

Twenty one distinct cDNA clones were found to be over-expressed in breast tumor and 
expressed at low levels in all normal tissues tested. The determined partial cDNA 
sequences for these clones are provided in SEQ ID NO: 53- 73. Comparison of the 
sequences of SEQ ID NO: 53, 54, and 68-71 with those in the gene bank as described 

30 above, revealed some homology to previously identified human genes. No significant 
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homologies were found to the sequences of SEQ ID NO: 55-67, 72 (referred to as JJ 
9434,71 17), and 73 (referred to as B535S). 

In a further experiment, cDNA fragments analyzed by DNA microarray 
were obtained from two subtraction libraries derived by conventional subtraction, as 

5 described above. In one instance the tester was derived from primary breast tumors. In 
the second instance, a metastatic breast tumor was employed as the tester. Drivers 
consisted of normal breast. 

cDNA fragments from these two libraries were submitted as templates 
for DNA microarray analysis. DNA chips were analyzed by hybridizing with fluorescent 

10 probes derived from mRNA from both tumor and normal tissues. Analysis of the data 
was accomplished by creating three groups from the sets of probes. The composition of 
these probe groups, referred to as Breast Tumor/mets, Normal non-breast tissues, and 
Metastatic breast tumors. Two comparisons were performed using the modified 
Gemtools analysis. The first comparison was to identify templates with elevated 

15 expression in breast tumors. The second was to identify templates not recovered in the 
first comparison that yielded elevated expression in metastatic breast tumors. An 
arbitrary level of increased expression (mean of tumor expression versus the mean of 
normal tissue expression) was set at approximately 2.2. 

In the first round of comparison to identify overexpression in breast 

20 tumors, two novel gene sequences were identified, hereinafter referred to as B534S and 
B538S (SEQ ID NO: 89 and 90), and six sequences that showed some degree of 
homology to previously identified genes (SEQ ID NO: 74-79). Additionally, in a second 
comparison to identify elevated expression in metastatic breast tumors, five novel 
sequences were identified, hereinafter referred to as B535S (overexpressed in this 

25 analysis as well as what was described above), B542S, B543S, P501S, and B541S (SEQ 
ID NO: 73, and 91-94), as well as nine gene sequences that showed some homology to 
known genes (SEQ ID NO: 80-88). Clone B534S and B538S (SEQ ID NO: 89 and 90) 
were shown to be overexpressed in both breast tumors and metastatic breast tumors. 
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Example 2 

GENERATION OF HUMAN CD8H- CYTOTOXIC T-CELLS THAT RECOGNIZE 
ANTIGEN PRESENTING CELLS EXPRESSING BlflEAST TUMOR ANTIGENS 

This Example illustrates the generation of T cells that recognize target 
cells expressing the antigen B51 IS, also known as 1016-F8 (SEQ ID NO: 56). Human 
CD8+ T cells were primed in-vitro to the B511S gene product using dendritic cells 
infected with a recombinant vaccinia virus engineered to express B511S as follows 
(also see Yee et al.. Journal of Immunology (1996) 157 (9):4079-86). Dendritic cells 
(DC) were generated from peripheral blood derived monocytes by differentiation for 5 
days in the presence of 50 jag/ml GMCSF and 30 |ag/ml IL-4. DC were harvested, 
plated in wells of a 24-well plate at a density of 2 x 10^ cells/well and infected for 12 
hours with B5 11 S expressing vaccinia at a multiplicity of infection of 5. DC were then 
matured overnight by the addition of 3 |Ltg/ml CD40-Ligand and UV irradiated at 
lOOjaW for 10 minutes. CD8h- T cells were isolated using magnetic beads, and priming 
cultures were initiated in individual wells (typically in 24 wells of a 24-well plate) using 
7x10^ CD8+ T cells and 1 x 10^ irradiated CD8-depleted PBMC; IL-7 at 10 ng/ml was 
added to cultures at day L Cultures were re-stimulated every 7-10 days using 
autologous primary fibroblasts retro virally transduced with B511S and the 
costimulatory molecule B7.1. Cultures were supplemented at day 1 with 15 I.U. of IL- 
2. Following 4 such stimulation cycles, CD8+ cultures were tested for their ability to 
specifically recognize autologous fibroblasts transduced with B511S using an 
interferon-y Elispot assay (see Lalvani et al J. Experimental Medicine (1997) 186:859- 
965). Briefly, T cells fi-om individual microcultures were added to 96-well Elispot 
plates that contained autologous fibroblasts transduced to express either B511S or as a 
negative control antigen EGFP, and incubated overnight at 37'' C; wells also contained 
IL- 12 at 10 ng/ml. Cultures were identified that specifically produced interferon-y only 
in response to B51 IS transduced fibroblasts; such lines were fiirther expanded and also 
cloned by limiting dilution on autologous B-LCL retrovirally transduced with B511S. 
Lines and clones were identified that could specifically recognize autologous B-LCL 
transduced with B51 IS but not autologous B-LCL transduced with the control antigens 
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EGFP or HLA-A3. An example demonstrating the ability of human CTL cell lines 
derived from such experiments to specifically recognize and lyse B511S expressing 
targets is presented in Figure 1 . 

Example 3 
SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on an Perkin Elmer/ Applied 
Biosy stems Division 43 OA peptide synthesizer using FMOC chemistry with HPTU (O- 
Benzotriazole-N,N,N\N'-tetramethyluronium hexafluorophosphate) activation. A Gly- 
Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
Cleavage of the peptides from the solid support may be carried out using the following 
cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol 

(40:1:2:2:3). After cleaving for 2 hours, the peptides may be precipitated in cold 
methyl-t-butyl-ether. The peptide pellets may then be dissolved in water containing 
0.1% trifluoroacetic acid (TFA) and lyophilized prior to purification by CI 8 reverse 
phase HPLC. A gradient of 0%-60% acetonitrile (containing 0.1% TFA) in water 
(containing 0.1% TFA) may be used to elute the peptides. Following lyophilization of 
the pure fractions, the peptides may be characterized using electrospray or other types 
of mass spectrometry and by amino acid analysis. 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purposes of 
illustration, various modifications may be made without deviating from the spirit and 
scope of the invention. 
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CLAIMS 

1. An isolated polypeptide comprising an immunogenic portion of a 
breast protein or a variant of said protein that differs only in conservative substitutions and/or 
modifications, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of: (a) 
nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

2. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 1 . 

3. An isolated polynucleotide molecule comprising a sequence provided 
in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94. 

4. An expression vector comprising a polynucleotide molecule according 
to any one of claims 2 and 3. 

5. A host cell transformed with the expression vector of claim 4. 

6. The host cell of claim 5 wherein the host cell is selected from the group 
consisting of £ coli^ yeast and mammalian cell lines. 

7. A pharmaceutical composition comprising the polypeptide of claim 1 
and a physiologically acceptable carrier. 

8. A vaccine comprising the polypeptide of claim 1 and a non-specific 
immune response enhancer. 

9. The vaccine of claim 8 wherein the non-specific immune response 
enhancer is an adjuvant. 
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10. A vaccine comprising a polynucleotide molecule of any one of claims 
2 and 3 and a non-specific immune response enhancer. 

1 1 . The vaccine of claim 1 0 wherein the non-specific immune response 
enhancer is an adjuvant. 

12. A pharmaceutical composition for the treatment of breast cancer 
comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
comprising an immunogenic portion of a breast protein, v/herein said protein comprises an 
amino acid sequence encoded by a polynucleotide molecule comprising a sequence selected 
from the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 1 1- 
16, 18-23, 25-44, 53, 54, 68-71, and 74-88; (b) complements of said nucleotide sequences; 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

13. A vaccine for the treatment of breast cancer comprising a polypeptide 
and a non-specific immune response enhancer, said polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of: 
(a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68- 
71, and 74-88; (b) complements of said nucleotide sequences; and (c) sequences that 
hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

14. The vaccine of claim 13 wherein the non-specific immune response 
enhancer is an adjuvant, 

15. A vaccine for the treatment of breast cancer comprising a 
polynucleotide molecule and a non-specific immune response enhancer, the polynucleotide 
molecule comprising a sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88; 
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(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

16. The vaccine of claim 15, wherein the non-specific immune response 
enhancer is an adjuvant. 

17. A pharmaceutical composition according to claims 7 or 12, for use in 
the manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

18. A vaccine according to any one of claims 8, 10, 13 or 15, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

19. A fusion protein comprising at least one polypeptide according to 

claim 1. 

20. A pharmaceutical composition comprising a fusion protein according 
to claim 19 and a physiologically acceptable carrier. 

21. A vaccine comprising a fusion protein according to claim 19 and a 
non-specific immune response enhancer. 

22. The vaccine of claim 21 w^herein the non-specific immune response 
enhancer is an adjuvant. 

23. A pharmaceutical composition according to claim 20, for use in 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 



24. A vaccine according to claim 21 , for use in the manufacture of a 
medicament for inhibiting the development of breast cancer in a patient. 
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25. A method for detecting breast cancer in a patient, comprising: 

(a) contacting a biological sample from a patient with a binding agent 
which is capable of binding to a polypeptide, the polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby detecting breast cancer in the patient. 

26. The method of claim 25 wherein the binding agent is a 
monoclonal antibody. 

27. The method of claim 26 wherein the binding agent is a 
polyclonal antibody. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) contacting a biological sample from a patient with a binding agent that 
is capable of binding to a polypeptide, said polypeptide comprising an immunogenic portion 
of a breast protein, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; 

(b) determining in the sample an amount of a protein or polypeptide that 
binds to the binding agent; 

(c) repeating steps (a) and (b); and 

(d) comparing the amount of polypeptide detected in steps (b) and (c) to 
monitor the progression of breast cancer in the patient. 
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29. A monoclonal antibody that binds to a polypeptide comprising an 
immunogenic portion of a breast protein or a variant of said protein that differs only in 
conservative substitutions and/or modifications, wherein said protein comprises an amino 
acid sequence encoded by a polynucleotide molecule comprising a sequence selected from 
the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45- 
52, 55-67, 72, 73, and 89-94: (b) complements of said nucleotide sequences; and (c) 
sequences that hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

30. A monoclonal antibody according to claim 29, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

31. The monoclonal antibody of claim 30 wherein the monoclonal 
antibody is conjugated to a therapeutic agent. 

32. A method for detecting breast cancer in a patient comprising: 

(a) contacting a biological sample from a patient with at least two 
oligonucleotide primers in a polymerase chain reaction, wherein at least one of the 
oligonucleotides is specific for a polynucleotide molecule encoding a polypeptide comprising 
an immunogenic portion of a breast protein, said protein comprising an amino acid sequence 
encoded by a polynucleotide molecule comprising a sequence selected from the group 
consisting of nucleotide sequences recited in SEQ ID NO: 1-94, complements of said 
nucleotide sequences and sequences that hybridize to a sequence of SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers, thereby detecting breast cancer. 

33. The method of claim 32, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94. 
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34. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies of claim 29; and 

(b) a detection reagent, 

35. A diagnostic kit comprising : 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a polynucleotide molecule comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
complements of said sequences and sequences that hybridize to a sequence of SEQ ID NO: 1, 
2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, or 74-88 under moderately stringent conditions; 
and 

(b) a detection reagent. 

36. The kit of claims 34 or 35 wherein the monoclonal antibodies are 
immobilized on a solid support. 

37. The kit of claim 36 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

38. The kit of claims 34 or 35 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 

39. The kit of claim 38 wherein the binding agent is selected from the 
group consisting of anti-immunoglobulins. Protein G, Protein A and lectins. 

40. The kit of claim 38 wherein the reporter group is selected from the 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 

41. A diagnostic kit comprising at least two oligonucleotide primers, at 
least one of the oligonucleotide primers being specific for a polynucleotide molecule 
encoding a polypeptide comprising an immunogenic portion of a breast protein, said protein 
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comprising an amino acid sequence encoded by a polynucleotide molecule comprising a 
sequence selected from the group consisting of nucleotide sequences recited in SEQ ID NOS: 
1-94, complements of said nucleotide sequences and sequences that hybridize to a sequence 
of SEQ ID NO: 1-94 under moderately stringent conditions. 

42, A diagnostic kit of claim 41 wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94, 

43, A method for detecting breast cancer in a patient, comprising: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide probe specific 
for a polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of 
a breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions; and 

(c) detecting in the sample a polynucleotide sequence that hybridizes to 
the oligonucleotide probe, thereby detecting breast cancer in the patient. 

44, The method of claim 43 wherein the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a sequence 
selected from the group consisting of SEQ ID NOS: 1-94. 

45, A diagnostic kit comprising an oligonucleotide probe specific for a 
polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of a 
breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences, and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions. 
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46. The diagnostic kit of claim 45, wherein the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

47. Peripheral blood cells from a patient incubated in the presence of at 
least one polypeptide of claim 1, such that T cells proliferate, for use in the manufacture of a 
medicament for treating breast cancer in a patient. 

48. The blood cells of claim 47 wherein the T cells is repeated one or 

more times. 

49. A composition for the treatment of breast cancer in a patient, 
comprising T cells proliferated in the presence of a polypeptide of claim 1, in combination 
with a pharmaceutically acceptable carrier. 

50. An antigen presenting cells incubated in the presence of at least one 
polypeptide of claim 1, for use in the manufacture of a medicament for treating breast cancer 
in a patient. 

51. The cells of claim 50 wherein the antigen presenting cells are selected 
from the group consisting of dendritic and macrophage cells. 

52. A composition for the treatment of breast cancer in a patient, 
comprising antigen presenting cells incubated in the presence of a polypeptide of claim 1, in 
combination with a pharmaceutically acceptable carrier. 
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COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
OF BREAST CANCER AND METHODS FOR THEIR USE 

ABSTRACT OF THE DISCLOSURE 

Compounds and methods for the treatment and diagnosis of breast cancer are 
provided. The inventive compounds include polypeptides containing at least a portion of a 
breast tumor protein. Vaccines and pharmaceutical compositions for immunotherapy of 
breast cancer comprising such polypeptides, or polynucleotide molecules encoding such 
polypeptides, are also provided, together with polynucleotide molecules for preparing the 
inventive polypeptides. 
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FIGURE IB 

Figure 1: Specific lytic activity of B511s-specific CTL clones 3-6-8 and 3-6-7 measured 
on autologous LCL transduced with B511s (filled squares) or HLA-A3 (open squares). 
Each data point is the average of triplicate measurements. 
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SEQUENCE LISTING 

<110> Steven G. Reed 
Xu, Jiangchun 
Dillon, Davin 



<120> Compound for Immunotherapy and Diagnosis 
of Breast Cancer and Methods for Their Use 



<130> 26000. 446C2 



<160> 95 



<170> FastSEQ for Windows Version 3.0 



<210> 68 
<211> 301 
<212> DNA 
<213> Human 



<400> 68 

ttgtgttggg gttccctttt ccggtcggcg tggtcttgcg agtggagtgt ccgctgtgcc 60 

cgggcctgca ccatgagcgt cccggccttc atcgacatca gtgaagaaga tcaggctgct 12 0 

gagcttcgtg cttatctgaa atctaaagga gctgagattt cagaagagaa ctcggaaggt 180 

ggacttcatg ttgatttagc tcaaattatt gaagcctgtg atgtgtgtct gaaggaggat 240 

gataaagatg ttgaaagtgt gatgaacagt ggggnatcct actcttgatc cggaanccna 3 00 

c 301 



<210> 69 
<211> 301 
<212> DNA 
< 2 1 3 > Human 



<400> 69 

tctatgagca tgccaaggct ctgtgggagg atgaaggagt gcgtgcctgc tacgaacgct 60 

ccaacgagta ccagctgatt gactgtgccc agtacttcct ggacaagatc gacgtgatca 12 0 

agcaggctga ctatgtgccg agcgatcagg acctgcttcg ctgccgtgtc ctgacttctg 180 

gaatctttga gaccaagttc caggtggacn aagtcaactt ccacatgntt gacgtgggtg 24 0 

gccagcgcga tgaacgccgc aagtggatcc agtgcttcaa cgatgtgact gccatcatct 300 

t 301 



<210> 70 
<211> 201 
<212> DNA 
<213> Human 



<400> 70 

gcggctcttc ctcgggcagc ggaagcggcg cggcggtcgg agaagtggcc taaaacttcg 6 0 

gcgttgggtg aaagaaaatg gcccgaacca agcagactgc tcgtaagtcc accggtggga 12 0 

aagccccccg caaacagctg gccacgaaag ccgccaggaa aagcgctccc tctaccggcg 18 0 

gggtgaagaa gcctcatcgc t 2 01 



<210> 71 
<211> 301 
<212> DNA 



2 



<213> Human 



<400> 71 

gccggggtag tcgccgncgc cgccgccgct gcagccactg caggcaccgc tgccgccgcc 60 

tgagtagtgg gcttaggaag gaagaggtca tctcgctcgg agcttcgctc ggaagggtct 12 0 

ttgttccctg cagccctccc acgggaatga caatggataa aagtgagctg gtacanaaag 18 0 

ccaaactcgc tgagcaggct gagcgatatg atgatatggc tgcagccatg aaggcagtca 24 0 

cagaacaggg gcatgaactc ttcaacgaag agagaaatct gctctctggt gcctacaaga 3 00 

a 301 



<210> 72 
<211> 251 
<212> DNA 
<213> Human 



<400> 72 

cttggggggt gttgggggag agactgtggg cctggaaata aaacttgtct cctctaccac 6 0 

caccctgtac cctagcctgc acctgtccac atctctgcaa agttcagctt ccttccccag 120 

gtctctgtgc actctgtctt ggatgctctg gggagctcat gggtggagga gtctccacca 18 0 

gagggaggct caggggactg gttgggccag ggatgaatat ttgagggata aaaattgtgt 24 0 

aagagccaan g 251 



<210> 73 
<211> 913 
<212> DNA 
<213> Human 



<400> 73 

tttttttttt tttttcccag gccctctttt tatttacagt gataccaaac catccacttg 60 

caaattcttt ggtctcccat cagctggaat taagtaggta ctgtgtatct ttgagatcat 12 0 

gtatttgtct ccactttggt ggatacaaga aaggaaggca cgaacagctg aaaaagaagg 18 0 

gtatcacacc gctccagctg gaatccagca ggaacctctg agcatgccac agctgaacac 24 0 

ttaaaagagg aaagaaggac agctgctctt catttatttt gaaagcaaat tcatttgaaa 300 

gtgcataaat ggtcatcata agtcaaacgt atcaattaga ccttcaacct aggaaacaaa 360 

attttttttt tctatttaat aatacaccac actgaaatta tttgccaatg aatcccaaag 420 

atttggtaca aatagtacaa ttcgtatttg ctttcctctt tcctttcttc agacaaacac 480 

caaataaaat gcaggtgaaa gagatgaacc acgactagag gctgacttag aaatttatgc 540 

tgactcgatc taaaaaaaat tatgttggtt aatgttaatc tatctaaaat agagcatttt 6 00 

gggaatgctt ttcaaagaag gtcaagtaac agtcatacag ctagaaaagt ccctgaaaaa 66 0 

aagaattgtt aagaagtata ataacctttt caaaacccac aatgcagctt agttttcctt 72 0 

tatttatttg tggtcatgaa gactatcccc atttctccat aaaatcctcc ctccatactg 780 

ctgcattatg gcacaaaaga ctctaagtgc caccagacag aaggaccaga gtttctgatt 840 

ataaacaatg atgctgggta atgtttaaat gagaacattg gatatggatg gtcagcccaa 900 

cacaatggaa ttc 913 



<210> 74 
<211> 351 
<212> DNA 
<213> Human 



<400> 74 

tgtgcncagg ggatgggtgg gcngtggaga ngatgacaga aaggctggaa ggaanggggg 60 

tgggtttgaa ggccanggcc aaggggncct caggtccgnt tctgnnaagg gacagccttg 120 

aggaaggagn catggcaagc catagctagg ccaccaatca gattaagaaa nnctgagaaa 180 

nctagctgac catcactgtt ggtgnccagt ttcccaacac aatggaatnc caccacactg 24 0 

gactagngga nccactagtt ctagagcggc cgccaccgcg gtggaacccc aacttttgcc 3 00 

cctttagnga gggttaattg cgcgcttggc ntaatcatgg tcataagctg t 3 51 



3 



<210> 75 

<211> 251 

<212> DNA 

<213> Human 



<400> 75 

tacttgacct tctttgaaaa gcattcccaa aatgctctat tttagataga ttaacattaa 6 0 

ccaacataat tttttttaga tcgagtcagc ataaatttct aagtcagcct ctagtcgtgg 12 0 

ttcatctctt tcacctgcat tttatttggt gtttgtctga agaaaggaaa gaggaaagca 18 0 

aatacgaatt gtactatttg taccaaatct ttgggattca ttggcaaata atttcagtgt 240 

ggtgtattat t 251 



<210> 76 
<211> 251 
<212> DNA 
<213> Human 



<400> 76 

tatttaataa tacaccacac tgaaattatt tgccaatgaa tcccaaagat ttggtacaaa 6 0 

tagtacaatt cgtatttgct ttcctctttc ctttcttcag acaaacacca aataaaatgc 12 0 

aggtgaaaga gatgaaccac gactagaggc tgacttagaa atttatgctg actcgatcta 18 0 

aaaaaaatta tgttggttaa tgttaatcta tctaaaatag agcattttgg gaatgctttt 240 
caaagaaggt c 



<210> 77 
<211> 351 
<212> DNA 
<213> Human 



<400> 77 

actcaccgtg ctgtgtgctg tgtgcctgct gcctggcagc ctggccctgc cgctgctcag 60 

gaggcgggag gcatgagtga gctacagtgg gaacaggctc aggactatct caagagannn 12 0 

tatctctatg actcagaaac aaaaaatgcc aacagtttag aagccaaact caaggagatg 18 0 

caaaaattct ttggcctacc tataactgga atgttaaact cccgcgtcat agaaataatg 240 

cagaagccca gatgtggagt gccagatgtt gcagaatact cactatttcc aaatagccca 3 00 

aaatggactt ccaaagtggt cacctacagg atcgtatcat atactcgaga c 3 51 



<210> 78 
<211> 1592 
<212> DNA 
<213> Human 



<400> 78 

gaattccatt gtgttggggc cctgggggcg gaggggaggg gcccaccacg gccttatttc 60 

cgcgagcgcc ggcactgccc gctccgagcc cgtgtctgtc gggtgccgag ccaactttcc 12 0 

tgcgtccatg cagccccgcc ggcaacggct gcccgctccc tggtccgggc ccaggggccc 180 

gcgccccacc gccccgctgc tcgcgctgct gctgttgctc gccccggtgg cggcgcccgc 24 0 

ggggtccggg gaccccgacg accctgggca gcctcaggat gctggggtcc cgcgcaggct 3 00 

cctgcagcag gcggcgcgcg cggcgcttca cttcttcaac ttccggtccg gctcgcccag 360 

cgcgctgcga gtgctggccg aggtgcagga gggccgcgcg tggattaatc caaaagaggg 42 0 

atgtaaagtt cacgtggtct tcagcacaga gcgctacaac ccagagtctt tacttcagga 480 

aggtgaggga cgtttgggga aatgttctgc tcgagtgttt ttcaagaatc agaaacccag 54 0 

accaactatc aatgtaactt gtacacggct catcgagaaa aagaaaagac aacaagagga 60 0 

ttacctgctt tacaagcaaa tgaagcaact gaaaaacccc ttggaaatag tcagcatacc 66 0 

tgataatcat ggacatattg atccctctct gagactcatc tgggatttgg ctttccttgg 72 0 

aagctcttac gtgatgtggg aaatgacaac acaggtgtca cactactact tggcacagct 780 
cactagtgtg aggcagtgga aaactaatga tgatacaatt gattbtgatt atactgttct 



840 



4 



acttcatgaa ttatcaacac aggaaataat tccctgtcgc attcacttgg tctggtaccc 900 

tggcaaacct cttaaagtga agtaccactg tcaagagcta cagacaccag aagaagcctc 96 0 

cggaactgaa gaaggatcag ctgtagtacc aacagagctt agtaatttct aaaaagaaaa 102 0 

aatgatcttt ttccgacttc taaacaagtg actatactag cataaatcat tcttctagta 1080 

aaacagctaa ggtatagaca ttctaataat ttgggaaaac ctatgattac aagtaaaaac 114 0 

tcagaaatgc aaagatgttg gttttttgtt tctcagtctg ctttagcttt taactctgga 12 0 0 

agcgcatgca cactgaactc tgctcagtgc taaacagtca ccagcaggtt cctcagggtt 1260 

tcagccctaa aatgtaaaac ctggataatc agtgtatgtt gcaccagaat cagcattttt 132 0 

tttttaactg caaaaaatga tggtctcatc tctgaattta tatttctcat tcttttgaac 1380 

atactatagc taatatattt tatgttgcta aattgcttct atctagcatg ttaaacaaag 1440 

ataatatact ttcgatgaaa gtaaattata ggaaaaaaat taactgtttt aaaaagaact 150 0 

tgattatgtt ttatgatttc aggcaagtat tcatttttaa cttgctacct acttttaaat 156 0 

aaatgtttac atttctaaaa aaaaaaaaaa aa 15 92 



<210> 79 
<211> 401 
<212> DNA 
<213> Human 



<400> 79 

catactgtga attgttcttg actccttttc ttgacattca gttttcanaa tttccatctt 60 

tcttctggaa ctaatgtgct gttctcttga ctgcctgctg ggccagcatc cgattgccag 120 

ccagaaacgt cacactgccc aagatggcca ggtacttcaa ggtctggaac atgttgagct 18 0 

gagtccagta gacatacatg agtcccagca tagcagcatg tcccaggtga aatataatcg 24 0 

tgctaggagc aaaagtgaag ttggagacat tggcaccaat ccggatccac tagttctaga 3 00 

gcggccgcca ccgcggtgga gctccagctt ttgttccctt tagtgagggt taattgcgcg 36 0 

cttggcgtaa tcatggncat agctgtttcc tgtgtgaaat t 401 

<210> 80 
<211> 301 
<212> DNA 
<213> Human 



<400> 80 

aaaaatgaaa catctatttt agcagcaaga ggctgtgagg gatggggtag aaaaggcatc 60 

ctgagagagt tctagaccga cccaggtcct gtggcacact atacgggtca ggaggggtgg 12 0 

aagacaggcc taagctctag gacggtgaat ctcggggcta tttgtggatt tgttagaaac 180 

agacattctt ttggcctttt cctggcactg gtgttgccgg caggtgggca gaagtgagcc 24 0 

accagtcact gttcagtcat tgccaccaca gatcttcagc agaatcttcc ggtaatcccc 3 00 



t 



301 



<210> 81 
<211> 301 
<212> DNA 
<213> Human 



<400> 81 

tagccaggtt gctcaagcta attttattct ttcccaacag gatccatttg gaaaatatca 60 

agcctttaga atgtggcagc aagagaaagc ggactacgca ggaacgggga gtttgggaga 120 

agctctcctg gtgttgactt agggatgaag gctccaggct gctgccagaa atggagtcac 18 0 

cagcagaaga actgntttct ctgataagga tgtcccacca ttttcaagct gttcgttaaa 240 

gttacacagg tccttcttgc agcagtaagt accgttagct cattttccct caagcgggtt 300 



t 



301 



<210> 82 
<211> 201 
<212> DNA 
<213> Human 



5 



<400> 82 

tcaacagaca aaaaaagttt attgaataca aaactcaaag gcatcaacag tcctgggccc 60 

aagagatcca tggcaggaag tcaagagttc tgcttcaggg tcggtctggg cagccctgga 12 0 

agaagtcatt gcacatgaca gtgatgagtg ccaggaaaac agcatactcc tggaaagtcc 18 0 

acctgctggn cactgnttca t ^^-^ 

<210> 83 
<211> 251 
<212> DNA 
<213> Human 

<400> 83 

gtaaggagca tactgtgccc atttattata gaatgcagtt aaaaaaaata ttttgaggtt 60 

agcctctcca gtttaaaagc acttaacaag aaacacttgg acagcgatgc aatggtctct 12 0 

cccaaaccgg ctccctctta ccaagtaccg taaacagggt ttgagaacgt tcaatcaatt 18 0 

tcttgatatg aacaatcaaa gcatttaatg caaacatatt tgcttctcaa anaataaaac 240 
cattttccaa a 

<210> 84 
<211> 301 
<212> DNA 
<213> Human 

<400> 84 

agtttataat gttttactat gatttagggc ttttttttca aagaacaaaa attataagca 6 0 

taaaaactca ggtatcagaa agactcaaaa ggctgttttt cactttgttc agattttgtt 12 0 

tccaggcatt aagtgtgtca tacagttgtt gccactgctg ttttccaaat gtccgatgtg 18 0 

tgctatgact gacaactact tttctctggg tctgatcaat tttgcagtan accattttag 240 
ttcttacggc gtcnataaca aatgcttcaa catcatcagc tccaatctga agtcttgctg 3 00 

c 

<210> 85 
<211> 201 
<212> DNA 
<213> Human 



60 
120 



<400> 85 

tatttgtgta tgtaacattt attgacatct acccactgca agtatagatg aataagacac 
agtcacacca taaaggagtt tatccttaaa aggagtgaaa gacattcaaa aaccaactgc 
aataaaaaag ggtgacataa ttgctaaatg gagtggagga acagtgctta tcaattcttg 180 
attgggccac aatgatatac c 

<210> 86 
<211> 301 
<212> DNA 
<213> Human 



60 



<400> 86 

tttataaaat attttattta cagtagagct ttacaaaaat agtcttaaat taatacaaat 
cccttttgca atataactta tatgactatc ttctcaaaaa cgtgacattc gattataaca 12 0 

cataaactac atttatagtt gttaagtcac cttgtagtat aaatatgttt tcatcttttt 180 
tttgtaataa ggtacatacc aataacaatg aacaatggac aacaaatctt attttgntat 240 
tcttccaatg taaaattcat ctctggccaa aacaaaatta accaaagaaa agtaaaacaa 



t 



300 
301 



<210> 87 
<211> 351 



6 



<212> DNA 
<213> Human 

<400> 87 

aaaaaagatt taagatcata aataggtcat tgttgtcaca acacatttca gaatcttaaa 6 0 

aaaacaaaca ttttggcttt ctaagaaaaa gacttttaaa aaaaatcaat tccctcatca 120 
ctgaaaggac ttgtacattt ttaaacttcc agtctcctaa ggcacagtat ttaatcagaa 180 
tgccaatatt accaccctgc tgtagcanga ataaagaagc aagggattaa cacttaaaaa 
aacngccaaa ttcctgaacc aaatcattgg cattttaaaa aagggataaa aaaacnggnt 
aaggggggga gcattttaag taaagaangg ccaagggtgg tatgccngga c 

<210> 88 
<211> 301 
<212> DNA 
<213> Human 



<210> 89 
<211> 591 
<212> DNA 
<213> Human 



240 
300 
351 



60 



<400> 88 

gttttaggtc tttaccaatt tgattggttt atcaacaggg catgaggttt aaatatatct 
ttgaggaaag gtaaagtcaa atttgacttc ataggtcatc ggcgtcctca ctcctgtgca 12 0 

ttttctggtg gaagcacaca gttaattaac tcaagtgtgg cgntagcgat gctttttcat 18 0 

ggngtcattt atccacttgg tgaacttgca cacttgaatg naaactcctg ggtcattggg 
ntggccgcaa gggaaaggtc cccaagacac caaaccttgc agggtacctn tgcacaccaa 



240 
300 
301 



60 



<400> 89 

tttttttttt tttttttatt aatcaaatga ttcaaaacaa ccatcattct gtcaatgccc 
aagcacccag ctggtcctct ccccacatgt cacactctcc tcagcctctc ccccaaccct 12 0 

gctctccctc ctcccctgcc ctagcccagg gacagagtct aggaggagcc tggggcagag 18 0 

ctggaggcag gaagagagca ctggacagac agctatggtt tggattgggg aagagattag 240 
gaagtaggtt cttaaagacc cttttttagt accagatatc cagccatatt cccagctcca 
ttattcaaat catttcccat agcccagctc ctctctgttc tccccctact accaattctt 



300 
360 

taactcttac acaattttta tccctcaaat attcatccct ggcccaacca gtcccctgag 420 

480 

540 
591 



cctccctctg gtggagactc ctccacccat gagctcccca gagcatccaa gacagagtgc 
acagagacct ggggaaggaa gctgaacttt gcagagatgt ggacaggtgc aggctagggt 
acagggtggt ggtagaggag acaagtttta tttccaggcc cacagtctct c 

<210> 90 
<211> 1996 
<212> DNA 
<213> Human 

<400> 90 

tttttttttt ttttttatca aatgaatact ttattagaga cataacacgt ataaaataaa 
tttcttttca tcatggagtt accagatttt aaaaccaacc aacactttct catttttaca 
gctaagacat gttaaattct taaatgccat aatttttgtt caactgcttt gtcattcaac 180 
tcacaagtct agaatgtgat taagctacaa atctaagtat tcacagatgt gtcttaggct 24 0 

tggtttgtaa caatctagaa gcaatctgtt tacaaaagtg ccaccaaagc attttaaaga 
aaccaattta atgccaccaa acataagcct gctatacctg ggaaacaaaa aatctcacac 
ctaaattcta gcagagtaaa cgattccaac tagaatgtac tgtatatcca tatggcacat 
ttatgacttt gtaatatgta attcataata caggtttagg tgtgtggtat ggagctagga 480 
aaaccaaagt agtaggatat tatagaaaag atctgatgtt aagtataaag tcatatgcct 
gatttcctca aaccttttgt ttttcctcat gtcttctgtc tttatatttt tatcacaaac 
caagatctaa cagggttctt tctagaggat tattagataa gtaacacttg atcattaagc 



60 
120 



300 
360 
420 



540 
600 
660 



7 



caacacaatg gaattc 



<210> 91 
<211> 911 
<212> DNA 
<213> Human 



<210> 92 
<211> 1710 
<212> DNA 
<213> Human 



acggatcatg ccactcattc atggttgttc tatgttccat gaactctaat agcccaactt 72 0 

atacatggca ctccaagggg atgcttcagc cagaaagtaa agggctgaaa aagtagaaca 78 0 

atacaaaagc cctcgtgtgg tgggaactgt ggcctcactc ttacttgtcc ttccattcaa 84 0 

aacagtttgg cacctttcca tgacgaggat ctctacaggt aggttaaaat acttttctgt 9 00 

gctattcagc cagaaatagt ttttgtgctg gatatgattt taaaacagat tttgtctgtc 960 

accagtgcaa aaacattaca gatgtctggg ctaatacaaa aacacataag aatctacaac 102 0 

tttatattta atactctatt caaatttaac tcaaagtaat gcaaaataat tagaagtaaa 1080 

aacttaattc ttctgagagc tctatttgga aaagcttcac atatccacac acaaatatgg 1140 

gtatattcat gcacagggca aacaactgta ttctgaagca taaataaact caaagtaaga 12 0 0 

catcagtagc tagataccag ttccagtatt ggttaatggt ctctggggat cccattttaa 12 6 0 

gcactctcag atgaggatct tgctcagttg ttagactatc attagtttga ttaagcaact 132 0 

gaagtttact tcataaatta ctttttccta tatccaggac tctgcctgag aaattttata 13 8 0 

cattcctcca aaggtaagta ttctccaaag gtaagtattt gactattaac acaaaggcaa 144 0 

tgtgattatt gcataatgac actaaatatt atgtggcttt tctgttaggt ttataagttt 150 0 

tcaatgatca gttcaagaaa atgcagatca tatataacta aggttttaca ccagtggttg 156 0 

acaaactatg gcccacaggc taaacccagc ctccccttgt ttttataaat aagttttatt 1620 

agacataacc acactcattc atttctgtat tgtgtatagc tgctttcacg ctatactagc 16 8 0 

agaactgaat agttgtgaca gagactgtat ggaccgtgaa gcataaatat ttaccatctg 1740 

gcccattcta aaaaaagtgt gccaattcct ggtttacact aaaatataga gtttagtggg 18 0 0 

aagcctattt gaaatgtgtt ttttttaggg gctgtaatta ccaattaaaa ttaaggttca 1860 

ggtgactcag caaccaaaca aaagggatac taatttttta tgaacaatat atttgtattt 192 0 

tatggacata aaaggaaact ttcagaaaga aaaggaggaa aataaagggg gaaagggacc 1980 



1996 



<400> 91 

gccctttttt tttttttttt cttgtttaaa aaaattgttt tcattttaat gatctgagtt 6 0 

agtaacaaac aaatgtacaa aattgtcttt cacatttcca tacattgtgt tatggaccaa 120 

atgaaaacgc tggactacaa atgcaggttt ctttatatcc ttaacttcaa ttattgtcac 180 

ttataaataa aggtgatttg ctaacacatg catttgtgaa cacagatgcc aaaaattata 24 0 

catgtaagtt aatgcacaac caagagtata cactgttcat ttgtgcagtt atgcgtcaaa 30 0 

tgcgactgac acagaagcag ttatcctggg atatttcact ctatatgaaa agcatcttgg 36 0 

agaaatagat tgaaatacag tttaaaacaa aaattgtatt ctacaaatac aataaaattt 420 

gcaacttgca catctgaagc aacatttgag aaagctgctt caataaccct gctgttatat 480 

tggttttata ggtatatctc caaagtcatg ggttgggata tagctgcttt aaagaaaata 54 0 

aatatgtata ttaaaaggaa aatcacactt taaaaatgtg aggaaagctt tgaaaacagt 600 

cttaatgcat gagtccatct acatattttc aagttttgga aacagaaaga agtttagaat 660 

tttcaaagta atctgaaaac tttctaagcc attttaaaat aagatttttt tccccatctt 72 0 

tccaatgttt cctatttgat agtgtaatac agaaatgggc agtttctagt gtcaacttaa 7 80 

ctgtgctaat tcataagtca ttatacattt atgacttaag agttcaaata agtggaaatt 84 0 

gggttataat gaaaatgaca agggggcccc ttcagcagcc actcatctga actagtaatc 9 00 

ccaacacaat g ^^'^ 



<400> 92 

tttttttttt tttttaactt ttagcagtgt ttatttttgt taaaagaaac caattgaatt 
gaaggtcaag acaccttctg attgcacaga ttaaacaaga aagtattact tatttcaact 
ttacaaagca tcttattgat ttaaaaagat ccatactatt gataaagttc accatgaaca 18 0 

tatatgtaat aaggagacta aaatattcat tttacatatc tacaacatgt atttcatatt 
tctaatcaac cacaaatcat ataggaaaat atttaggtcc atgaaaaagt ttcaaaacat 



60 
120 



240 
300 



8 



taaaaaatta 
aaataaatag 
ccaagcgagg 
ggcttatttt 
catggctgct 
gtgaagattc 
ggggatgggc 
attcactgat 
gatacgtgga 
aaaaggaaaa 
caaagcagaa 
agcaatcaaa 
taactcatgc 
ggcttgtcag 
ctcttgcact 
cactgcagct 
ctcctcctgc 
^raggcaagc 
ccatcagaga 
cgatccagtc 
ttgctgctgg 
tcatttttaa 
tgaatggcag 
cttttgcctc 



aagttttgaa 
ttaatcagct 
gtcagcatgc 
tgaagtgaaa 
aaactgttcc 
atgacaacat 
cagtagatgg 
gtttatagta 
atttaaatgc 
actattccca 
agcatatact 
tctgtaaagc 
tgtgcttgct 
gtgacatgct 
tgcaggcagt 
ggattctctg 
tccagtactt 
actcgccagc 
tgtatttggt 
cagtgttact 

ggtgtgctgg 

aagccaaaca 
gaggaagcat 
tgcccaacac 



acaaatcaca 
ttacttatta 
agggtataat 
tgtcacaggg 
catgaagagt 
atttttttta 
a-gggtatctg 
tcaacagtct 
aaattgcatt 
aagaaggtcc 
ttcaagtgag 
agatggttac 
ggatttgctg 
ctcaaagttg 
gactactgtg 
ggtacgggtt 
tgttccatag 
acacaccagc 
ggaacgcagt 
gaaatgcctg 
aacaggttta 
gcttttcatt 
ggtgagtaga 
aatggaattc 



tgtgaaagct 
gctgctgcca 
ttcatactat 
tctttcattc 
accaaaaaag 
acctgttttg 
agaagccctt 
tttaagaaca 
catggatata 
tgatacttaa 
aaaacagcag 
tagtaagtct 
gctcttttcc 
tgactggact 
attttgtagg 
ttgtcattga 
cctcctccaa 
tccttcagag 
tcccggcaac 
cctccatttc 
accacatgtg 
aggatgcatg 
ggatttgctt 



cattaaataa 
tgcatttctg 
gcgaccgtaa 
tctttcaaag 
cacctttctg 
aaggagtttt 
ttctgtttta 
atgaggaatt 
cctacatctt 
gacagcttgc 
tggcaggctt 
agttatggga 
gctctctgtg 
cgttgtgctg 
tgcgtgtgct 
cacaccgcca 
tccagttagg 
ggctgatgct 
ccacttgaac 
tggct tgatt 
aataaaggat 
caaggggaag 
gactgaagag 



taacattgac 
gcattccatt 
agagctacag 
gaagatcact 
aaatgttact 
gtttaggaga 
aaatataatg 
aaaactacag 
gaaaaacttg 
tgggtttgat 
gagtcttcca 
gtctgagttc 
atgctggact 
ccgggtgtac 
gccatcttgg 
ctcctgggag 
gagcactggc 
ggtgcactgg 
ccgagtgttc 
caacgtgctg 
ttctgtggca 
gagatagaaa 
ctggttaatt 



360 
420 
480 
540 
600 
660 
720 
780 



840 



900 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1710 



<210> 93 
<211> 251 
<212> DNA 
<213> Human 

<400> 93 

cccaccctac ccaaatatta gacaccaaca cagaaaagct agcaatggat tcccttctac 6 0 

tttgttaaat aaataagtta aatatttaaa tgcctgtgtc tctgtgatgg caacagaagg 12 0 

accaacaggc cacatcctga taaaaggtaa gaggggggtg gatcagcaaa aagacagtgc 18 0 

tgtgggctga ggggacctgg ttcttgtgtg ttgcccctca agactcttcc cctacaaata 24 0 

actttcatat g 251 

<210> 94 
<211> 738 
<212> DNA 
<213> Human 

<400> 94 

cccttttttt ttttttttcc acttctcagt ttatttctgg gactaaattt gggtcagagc 6 0 

tgcagagaag ggatgggccc tgagcttgag gatgaaagtg ccccagggag attgagacgc 12 0 

aacccccgcc ctggacagtt ttggaaattg ttcccagggt tcaactagag agacacggtc 18 0 

agcccaatgt gggggaagca gaccctgagt ccaggagaca tggggtcagg ggctggagag 240 

atgaacattc tcaacatctc tgggaaggaa tgagggtctg aaaggagtgt cagggctgtc 3 00 

cctgcagcag gtggggatgc cggtgtgctg agtcctggga tgactcagga gttggcctgg 36 0 

atggtttcct ggatccactt ggtgaacttg cagaggttcg tgtagacacc cggtctgttg 42 0 

ggccgggcac aagggtaatc tccccaggac acgagtccct gcagggagcc attgcagacc 480 

acaggccccc cagaatcacc ctggcaggag tctctacctg ctttgtcacc ggcgcagaac 54 0 

atggtgtcat ctatctgtct cgggtaagca tcctcgcacc ttttctgact tagcacgctg 6 00 

atattcaagc actggaggac cttagggaag tgcacttggg ggctcttggt tgtcccccag 660 

ccagacacca agcactttgt cccagcagag ggacaatgag aggagacgtt gatgggtctg 72 0 

acatctttag tgggacga 738 



